On the Entropy of Random Fibonacci Words 



JOHAN NiLSSON 



Centre for Mathematical Sciences, Lund University, LTH 
Box 118, SE-221 00 LUND, SWEDEN 
j ohcinnQmaths . 1th . se 



Abstract 



The random Fibonacci chain is a generalisation of the classical Fi- 
bonacci substitution and is defined as the rule mapping H> 1 and 
1 I— )■ 01 with probability p and 1 i— >■ 10 with probability 1 — p for 
< p < 1 and where the random rule is applied each time it acts on a 
1. We show that the topological entropy of this object is given by the 
growth rate of the set of inflated random Fibonacci words. 

MSC2010 classiflcation; 68R15 Combinatorics on words, 05A16 Asymptotic enu- 
meration, 37B10 Symbolic dynamics. 



In [2j Godreche and Luck define the random Fibonacci chain by the 
generalised substitution 



for < p < 1 and where the random rule is applied each time 9 acts on a 1. 
They introduce the random Fibonacci chain when studying quasi-crystalline 
structures and tilings in the plane. In their paper it is claimed without proof 
that the topological entropy of the random Fibonacci chain is given by the 
growth rate of the set of inflated random Fibonacci words. We give here a 
proof of this fact. 

Before we can state our main theorem we need to introduce some no- 
tation. A word w over an alphabet S is a finite sequence wiW2 ■ ■ -Wn 
of symbols from E. We let here S = {0, 1}. We denote a sub-word or 
a factor of w by w[a,b] = WaWa+iWa+2 ■ ■ ■ wt^iWb and similarly we let 
VF[a, 6] = {t(;[a, 5] : w € W}. By | • | we mean the length of a word and 
the cardinality of a set. Note that \w[a,b] \ = b — a + 1. 

For two words u = uiU2Us . . .Un and v = viV2Vs ... t;™, we denote by uv 
the concatenation of the two words, that is, uv = U1U2U3 . . . UnViV2 ■ ■ ■ Vm- 
Similarly we let for two sets of words U and V their product be the set 
UV = {uv : u € U,v G V} containing all possible concatenations. 

Letting 9 act on the word repeatedly yields us an infinite sequence of 
words r„ = 0"^^(O). We know that ri = and r2 = 1. But is either 01 or 
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10, with probability p or 1—p respectively. The sequence {rn}^=i converges 
in distribution to an infinite random word r. We say that is an inflated 
word (under 6) in generation n and we introduce here sets that correspond 
to all inflated words in generation n; 

Definition 1. Let Aq := 0, Ai := {0} and A2 := {1} and for n > 3 we 

define recursively 

An := An-lAn-2 U A„_2^n-1 (2) 

and we let A := lim„^oo ^n- 

We shall later on, as a direct consequence of Proposition [3 see that the 
set A is a well defined set. It is clear from the definition of An that all 
elements in An have the same length, that is, for all x,y An we have 
|x| = \y\. We shall frequently us the length of the elements in An, and 
therefore we introduce the notation 

fn '■= \x\ for X € An, n > 1 

and for completeness we set /o := 0. From the recursion ([2]) and the defini- 
tion of fn we have immediately the following proposition 

Proposition 2. The numbers fn are the Fibonacci-numbers, that is, /o = 0, 
/i = 1 and fn = + fn-2 forn>2. 

For a word w we say that x is a factor of w if there are two words u, v 
such that w = uxv. The factor set F{S,n), of a set of words S, is the set 
of all factors of length n of the words in S. We introduce the abbreviated 
notation Fn for the special factor sets of the random Fibonacci words, 

Fn:=F{A,fn). 

It seems to be a hard problem to give an explicit expression for the size of 
We have to leave that question open, but we shall give a rough upper 
bound of 

The topological entropy of the random Fibonacci chain is defined as the 
limit lim„^oo ^ log \F{A, n)\. The existence of this limit is direct by Fekete's 
lemma [1] since we have sub-additivity, log \F{A,n + m)\ < log |F(74, n)| + 
log \F{A,m)\. We can now state the main result in this paper 

Theorem 3. The logarithm of the growth rate of the size of the set of 
inflated random Fibonacci words equals the topological entropy of the random 
Fibonacci chain, that is 

\og\An\ log|F„| 
hm = hm — . (3) 

n^oo /„ n^oo 

The outline of the paper is that we start by studying the set An and 
restate some of the results obtained by Godreche and Luck. Thereafter we 
continue by looking at sets of prefixes of the set An- Next we give a finite 
method for finding the factor set Fn and finally we present an estimate of 
\Fn\ in terms of \An\, leading to the proof of Theorem [3l 
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2 Inflated Words 

In this section we present the sets of inflated random Fibonacci words 
and give an insight to their structure. For some smah values of n we have 
directly from the definition 

^i = {0}, A2 = {1}, ^3 = {01, 10}, ^4 = {011,101,110}, 

_ f 01011,01101,01110,10011, 
10101,10110,11001,11010 

For a word x we denote the reversed word of x by x'^ , that is, if x = 
x\X2 ■ ■ - Xn then x'^ = x„x„-_i . . . xi. Similarly we write A!^ = {x^ : x G A^}. 

Proposition 4. For n > 1 we have An = A^^. 

Proof. We give a proof by induction on n. The basis cases n = 1 and 
n = 2 are clear as Ai = {0} and A2 = {!}■ Now assume for induction that 
the statement of the proposition holds for 2 < n < p. Using ([2]) and the 
induction assumption give 

Ap^i = [ApAp—i U Ap—iAp) 

— J^p_i^p U J±pJ±p_i 

— Ap—iAp U ApAp^i 
= ^p+i) 

which completes the induction. □ 

Now let us turn to the question of the size of An- In the recursive 
definition ([2]) of An we have that the overlap is 

AnAn—l n An-lAn = ^„_2^}i-3^n-2 • 

Knowing this we obtain the recursion for n > 3 

\An\ = 2\An-l\\An-2\ - |^„-2 P | A„-3 1 , (4) 

with 1 = 0, I All = 1 and \A2\ = 1. The recursion was given by 
Godreche and Luck in |2]. They also gave the following proposition, which 
we for completeness restate here with its short proof. 

Proposition 5 ([2]). With \Ai\ = 1 and \ A2\ = I we have for n> 3 

Tl — 1 

\An\ = ^j^n-lPn-2|- (5) 

n — 1 
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Proof. Let = |^„|/(|^„_i||^n-2|)- Then we can rewrite the recursion 
di]) into 

an = 2 

with the initial condition 03 = 2. We claim that a„ = ^5^. The proof of the 
claim is a straight forward induction. The basis is clear, and if we assume 
the claim being true for n = p we get for n = p + 1 

1 _ 1 _ p 
Op+i — 2 — 2 — — j- — -, 

which completes the induction and the proof of the proposition. □ 

Without explicitly stating it, the following proposition was also given by 
Godreche and Luck in [2]. We present it here, filling in the details. 

Proposition 6 ([2j). For n > 3 we have 

\An\ = {n-l)l[{n-iy'-\ (6) 

i=2 

Proof. We give a proof by induction on n. For the basis steps, n = 3 and 
n = 4 we have 1 = 2 • 1*^ = 2 and \Ai\ = 3 • 2*^ • 1^ = 3. Now assume for 
induction that ([6]) holds for 4 < n < p. Then from ([5]) and the induction 
assumption we have 



I^P+i| 



^ \A \\A A 
I -^p 1 1 ^p- 1 1 



p — 1 

-iip-i)iiip-i)f^-A (p-2) n ((^-1) 



p 



4=2 / \ 1=2 

\ /(p-l)-l 



p{p-2)[Yi{p-iy^-'\ { n (p - + 

\i=2 / \ i=2 

p{p-2) (jlip - iy^-'^ (jj{p - j 

p{p- 2)-^! {p - 2)-^» (jlj-'^ ~ iV''^'^^''''^ 



/p-i 



p{p- iy° {p - n ~ 



a=3 
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=p{p-iy°{p-2y^ n(^~ ~ 

\i=4 

/(P+l)-l 

=p\ n (b+i)-^/'" 

y i=2 

which completes the induction. □ 

The sequence (see Tablefflon pageE]) is the sequence A072042 

in the On-line Encyclopedia of Integer Sequences |3j. 
From Proposition [6] we have 



In In \ 



= — log(n - 1) + ^ ^^log(n - i). (7) 

In Jn 

For large n the Fibonacci number /„ can be approximated by -i=( i+^v^ )". 
Hence the sum in ([7|) converges when letting n tend to infinity and therefore 
clearly implies the existence of the limit. A numerical calculation in Maple 
gives 

lim ^^^^IM. ~ 0.444399 ^ log 1.559553. 
3 Prefixes 

In this section we look at and present some properties of sets of prefixes 
of the inflated random Fibonacci words. 

Proposition 7. For n > 3 and k > we have 

v4„[l,/n-l] = ^n+fc [!,/«- 1] (8) 

and symmetrically 

An[2, fn] = ^n+kifn+k — fn + '^i fn+k]- (9) 

Proof. We put our initial attention to equality Let us first consider the 
case with k = 1. We will give a proof by induction on n. For the basis we 
have 

.43[1,1] ={0,1} = ^4[1,1] and yl4[l, 2] = {01,10,11} =A[1, 2]. 



5 



J. NiLSSON / On the Entropy of Rand. Fib. Words 



Now assume that Q holds for 4 < n < m. In the induction step n = m + 1 
we get 

^m+l[l,/m+l — 1] C {Am+lAm U A„iAm+l)[l, fm+1 — 1] 
= Am+2[^, fm+1 — !]• 

For the reversed inclusion let x S Am+2[^, fm+i — !]• By the recursion 
identity ^ we have to look at two cases. First, if x G (Am+i^m)[l, /m+i — 1] 
then we clearly have x £ Am+i[l, fm+i — 1] and we are done. Secondly, we 
have to deal with the case x G {AmAm+i)[l, fm+i — !]• For this case we use 
the induction assumption twice to obtain 

^m-l[l,/m-l — 1] = ^m[l,/m-l — 1] 

= ^m+l[l, /m-1 — !]• 

By the help of the above equality it follows that 

{AmAm+l)[^, fm+l " 1] = (^m+1 [1, /m-1 — 1]) 

= (^m— 1 [1) /m— 1 ~ 1]) 
= (^m^m-l) [1, /»n+l — 1] 
C {^AraAjn-l U Ajn—iAm) [1, fm+1 ~ 1] 
= Ajn+l[l, fm+1 - 1], 

which completes the induction. 

Now let us turn to the case with k > 1. We shall also prove this by 
induction. We proved the basis step, k = 1, above. Now we assume that ([8]) 
holds for k = p. For A: = p + 1 we have by what we proved above 

^n+p[l) fn+p ~ 1] = ^n+p+lil) fn+p ~ !]• 

But then also An+p[l, fn — 1] = ^n+p+i[l,/n — 1] and therefore by our 
induction assumption 

A„[l, /„-!]= An+p[l, fn-l]= A„+p+i[l, /„-!], 

which completes the proof of ([8]) . Equality ([9]) is direct from ([8]) by symme- 
try. □ 

Proposition 8. For n> A we have 

An C [An-l[lJn-l-l]){^,l?[An-2%fn-2]). (10) 
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Proof. Denote the right hand side set in (|10|) by B„. If n = 4 we have 
B4: = {Asll, 1]){0, 1}^ = {0, 1}^, which clearly is a superset of A4. If n > 5 
then by Proposition [7] we have = Bn[l,fn-i] and An[fn-i + 

2,/„] = An-2[2,fn-2] = -Bn[/n-i + 2,/„]. As Bn by definition consist of 
all words starting with a prefixes in An[l) fn-i] and ending with a suffix in 
An[fn-i + 2, fn] it must Contain all sequences from An- □ 

Note that there clearly is a symmetric analogous of the result of Propo- 
sition [8l 



4 Factor Sets 



The aim of this section is to give a finite method for finding the factor 
set Fn- For some small values of n we have 

F{As, /2) = {0, 1}, F(A4, /a) = {01, 10, 11}, 

F(^5, /4) = {001, 010, Oil, 100, 101, 110, 111}, 

00101, 00110, 00111, 01001, 01010, 
01011, 01100, 01101, OHIO, 01111, 
F^AgJ^) = I 10010,10011,10100,10101,10110, >. 

10111, 11001, 11010, 11011, 11100, 
11101,11110 

We shall shortly see that for n > 4 the above sets F{An+i, fn) actually 
is Fn- It is clear from above that F^ 7^ ^(^4,73) as words in ^(^5,74) 
have the sub-word 00, while this sub-word is not in F^A^, f^)- By computer 
calculation we can find the size of F{An-^-i, fn) for some additional small 
values of n, see Table [1] on page [TTJ 

Proposition 9. For k > 1 and n> A we have 

F{An+l,fn)=F{An+k,fn)- (H) 



Proof- We give a proof by induction on k- For the basis step, k = 2, we see 
that F{Ai+i, fi) = F(^4_|_2, fi), (this is direct as the word 000 is not a sub- 
word of an element in F(Aq, /s)). Hence we may assume that n > 5. From 
the recursion ([2]) we have directly that F{An+i, fn) C F{An+2, fn)- For the 
reversed inclusion we have by the recursion identity ([2]) and symmetry that 
it is enough to show that any sub- word of length /„ in An+iAn is also a 
sub- word in ^„,+i. Let us define Xi = x\i-,i — \ + /„] where x G An+iAn and 
1 < i < /„+i + 1. 

For 1 <i < fn-i + 1 we have directly that sub- word in An+i- 

For fn-i + 2 < i < 2/„_i we have by the definition of Xi that it is a 
sub-word in An+iAn[fn-i + 2,/„+i + fn-i — 1]- (The inequalities defining 
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the case are valid if /„,-i > 3, which requires that n > 5). By Proposition [7] 
and the recursion ([2]) we have 

{An+lAn) [fn-1 + 2, fn+1 + fn-1 " 1] = 

= {An+l[fn-l + 2, fn+l]) {An[l, fn-1 " 1]) 

= {An[2,fn]){An-l[l,fn-l-l]) 

= {AnAn-l)[2, fn+1 - I] 

C {AnAn-1 U An-lA„) [2, f„,+l " 1] 

= An+l[2, fn+1 — 1], 

which gives that Xi IS 3j sub- word of An+i- 

For 2 fn-1 <i< fn+1 we have that Xi is a sub-word in {An+iAn)[fn + 
2, fn+2\- Proposition [7] and the recursion ([2]) now gives 

{An+lAn) [fn + 2, fn+2] = (^n.+l[/n + 2, fn+l]) An 

= {An-l[2,fn-l])An 
= {An-lAn)[2, fn+l - 1] 
C {AnAn-1 U An-lAn) [2, fn+l " 1] 
= An+l[2, fn+l — 1], 

and again we see that Xi is a sub- word of An+i- 

For finally i = fn+i + 1 we have that Xj is an element in A^ and therefore 
also a sub- word in An+i, which concludes the proof of the basis step. 

Assume for induction that equality (jlip holds for k = p. For the induc- 
tion step, k = p + 1, we have from what we just proved 

F{An+p, fn+p) = F{An+p+l, fn+p)- 

But then we must also have F{An+p, fn) = F{An+p+i, fn)- The induction 
assumption now gives = F{An+p,fn) = F{An+p+i, fn), which 

proves the proposition. □ 

Proposition 10. For n > 4 we have Fn = F{An+i, fn)- 

Proof. It is clear that F{An+i-, fn) C Fn- For the converse let x £ Fn- Then 
there are two words u and v such that uxv € Am for some m > n + 1, that 
is X G F{Am, fn)- Proposition [9] now gives F{Am, fn) = F{An+i, fn)- □ 

5 Upper Bound 

The aim of this section is to give an upper bound of the size of \Fn\ in 
terms of \An\- 

Proposition 11. For n > 3 and 1 < k < fn — 1 we have 

\An[l,k]\ - \An[k + lJn]\ <4"-Vn|- (12) 
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Proof. It is clear from the symmetric structure of A„ that we only have to 
give a proof for the case 1 < k < [^fn\- We give a proof by induction on n. 
For the basis case n = 3 we have 

\A3[1, 1]| • \As[2, 2]| = |{0, 1}| • [{0, 1}| = 2 • 2 < 4 • 2 = 4^'Ms\. 

Similarly we have for n = 4 

|^4[1,1]| • |A4[2,3]| < 1^41 • 1^41 = 3 • 1^41 < 4^-2|yl4| 

and for n = 5 with k = 1,2 

\A5[k,l]\ ■ \A5[k + 1,5]\ < I^sl ■ l^sl =8 - l^sl <45-2|A5|. 



Now assume for induction that ()12|) holds for 5 < n < p. For the induction 
step, n = p + 1, we have from Proposition [71 Proposition [51 the induction 
assumption and ([5|) that 

\Ap+i[l,k] \ ■ \Ap+i[k + 1, fp+i] \ = 

= \Ap[l,k]\ ■ \Ap+i[k + 1, fp+i]\ 

< \Ap[l, k]\-\ fp - 1]) {0, 1}2 {Ap_i[2, [k + 1, /p+i] 

= \Ap[l,k]\-\{Ap[k + l,fp-l]){0,l}^Ap^i[2,fp_i])\ 

< \Ap[l,k]\ ■ \Ap[k + l,fp]\-2^-\Ap_i[2,fp_i]\ 

< 4P-^ ■ \Ap\ ■ 4 ■ \Ap_i\ 
<4(*^+i)-2|^,+i|, 

which completes the induction. □ 

The estimate in Proposition [TT] seems to be far off from the true upper 
bound of the left hand side in ([T2]). If we let 



\An[l,k]\ ■ \An[k + l,fn]\ 

Cn = max ■ — — ; 

l<fc</„-l \An\ 

then computer calculations (see Table [Tj on page [TT]) motivate the following 
conjecture 

Conjecture 12. There is a constant C such that for all n > 3 and 1 < k < 

fn — 1 we have 

\An[l,k]\ ■ \An[k+l,fn]\ < C ■ \An\. 

Prom our computer calculations (see Table [T] on page [TT]) we further 
conjecture that we may find C < 3, but we have to leave this question open 
as well as the more delicate question of finding the optimal value of such a 
constant, Cq = sup„>3C„. 

We can now give the estimate we set out to find. Compared to computer 
calculations our estimate seems to be far from the optimal one, but it will 
be sufficient for our purpose. 
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Proposition 13. For n >3 we have 

<2(4"-2/„_i + l) \An\. 

Proof. For n = 3 we have 

IF3I = 4 < 2(4 • 1 + 1) • 2 = 2(43-2/3-1 + 1)|^3|. 

For n > 4 we have from Proposition [10] that Fn = F{An+i, fn). Therefore 
it is enough to give an estimarte of the number of subwords of length /„ 
there are in the words in ^n+i- Let us start by considering the number of 
subwords in the subset AnAn-i- We define Xk = x[k,k — 1 + fn], where 
X G AnAn-i for 1 < k < fn-i + 1- Clearly we have Xk ^ Fn- If A; = 1 
then we have xi € An and we see that there are \An\ different xis. If 
2 < A; < fn-i + 1 we have by Proposition [7] 

Xk G {An[k, /„])(A„_i[l, A; - 1]) = {An[k, fn]){An[l, k-l]). 

Proposition [11] now gives that there are at most 4"'~2|^„| different x^'s for 
each of these A;'s. Adding up the estimates and include the symmetric case, 
X G An-iAn, completes the proof. □ 



6 Proof of the Theorem 



Finally, we have now gathered enough background to prove our theorem 

Proof of Theorem [3 We have already dealt with the existence of both limits 
in ([3|). Since clearly A„ C Fn it now follows from Proposition [TS] 

logl^nl . log|-Fn| 

hm < lim 



fn fn 

log (2(4"-2/n-l + l)|^n| 



< lim 

n->oo /„ 

^ ^. log(4"-i2"|A„|) 

< lim 

n^oo 

3nlog2 \og\Ar. 

< lim h lim — 

n->oo /„ n^co 

lim 



which completes the proof. □ 
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7 Numerics 

We present here the output from our computer calculations, which were 
obtained by a small JAVA-program on a standard PC. 



Table 1 



n 


fn 






|i^(^n+l,/n)| 



















1 


1 


1 


2 


1 




2 


1 


1 


2 


2 




3 


2 


2 


4 


3 


2.0 


4 


3 


3 


7 


7 


2.0 


5 


5 


8 


22 


22 


2.0 


6 


8 


30 


108 


108 


2.13333 


7 


13 


288 


1356 


1356 


2.11111 


8 


21 


10080 


65800 


65800 


2.17143 


9 


34 


3317760 


30139200 


30139200 


2.16389 
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